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ABSTRACT 

A major challenge in metabolic engineering and syn- 
thetic biology is to balance the flux of an engineered 
heterologous metabolic pathway to achieve high 
yield and productivity in a target organism. Here, 
we report a simple, efficient and programmable 
approach named 'customized optimization of meta- 
bolic pathways by combinatorial transcriptional en- 
gineering (COMPACTER)' for rapid tuning of gene 
expression in a heterologous pathway under distinct 
metabolic backgrounds. Specifically, a library of 
mutant pathways is created by de novo assembly 
of promoter mutants of varying strengths for each 
pathway gene in a target organism followed 
by high-throughput screening/selection. To demon- 
strate this approach, a single round of COMPACTER 
was used to generate both a xylose utilizing 
pathway with near-highest efficiency and a 
cellobiose utilizing pathway with highest efficiency 
that were ever reported in literature for both labora- 
tory and industrial yeast strains. Interestingly, these 
engineered xylose and cellobiose utilizing pathways 
were all host-specific. Therefore, COMPACTER rep- 
resents a powerful approach to tailor-make meta- 
bolic pathways for different strain backgrounds, 
which is difficult if not impossible to achieve by 
existing pathway engineering methods. 

INTRODUCTION 

Microbial synthesis of value-added compounds typically 
depends upon the creation or introduction of heterologous 
metabolic pathways into the production hosts (1^1). 
Often, in order to enable commercially viable production 
of these compounds, the metabolic flux in the heterol- 
ogous pathways must be optimized to avoid metabolic 



burden from over-expression of certain genes, redox 
imbalance from unmatched cofactor specificity, accumu- 
lation of unstable or toxic intermediates or other bottle- 
necks that result in growth inhibition (5-11). Traditional 
approaches for balancing metabolic flux involve iden- 
tifying bottlenecks in metabolic pathways and 
debottlenecking the pathways. The most common 
debottlenecking strategies include the over-expression of 
key metabolic genes, deletion of competing pathways and 
the improvement of certain catalytic enzymes through 
protein engineering. Traditional approaches usually 
focus on a certain key step in the pathway instead of the 
whole dynamic metabolic network (the network consisting 
of the multi-step heterologous pathway and the overall 
metabolic background of the host strain). As a result, 
most traditional approaches have met with limited 
success in the optimization of multi-step pathways (12). 

Recently, a number of innovative approaches have been 
developed that balance metabolic flux at a global level 
through either perturbation of global transcription ma- 
chinery (13), genomic-scale mapping of fitness altering 
genes (14,15) or multiplex automated genome engineering 
(16). In addition, new strategies have been developed 
to balance the metabolic flux within a target pathway 
by tuning pathway gene expression through engineering 
of the promoters (13), ribosome binding sites (7) and Aller- 
genic regions (5). These new approaches have enabled sim- 
ultaneous optimization of a metabolic pathway to a 
certain degree. However, due to the distinct metabolic 
backgrounds between laboratory strains and industrial 
strains, pathways optimized in laboratory strains may 
not be transferable to industrial production strains (17). 
Therefore, it is highly desirable to simultaneously balance 
and fine-tune the metabolic flux within a heterologous 
pathway based on the metabolic background of the host 
strain. 

To this end, we developed the COMPACTER method 
that is capable of simultaneously optimizing multiple 
genes in a metabolic pathway and tailoring the target 
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pathway to a production host of interest. In this approach, 
a metabolic pathway is introduced into a host strain on a 
single copy vector with each gene under the control of 
a distinct promoter and terminator pair. The use of a 
single copy vector will eliminate the potential issue of 
unstable gene expression levels due to the varying copy 
number of a multi-copy vector and offer a greater flexibil- 
ity compared to an integrative vector because the same 
pathway library assembled on a single copy vector can 
be freely transferred to different strains. The use of a 
distinct promoter and terminator pair for each pathway 
gene is designed to avoid repetitive sequences that may 
result in undesired recombination events in the target 
pathway. Nucleotide analog mutagenesis (18) is used to 
generate a series of promoter mutants of varying 
strengths. These promoter mutants are assembled with 
the metabolic genes of interest into a library of mutant 
pathways with different expression patterns using the 
DNA assembler method (19). In principle, this library 
should contain mutant pathways with all possible combin- 
ations of expression levels present in the promoter 
mutants. A high-throughput screening or selection 
method is then used to identify a mutant pathway with 
a balanced metabolic flux from the library of mutant 
pathways in the target host (Figure 1). 

MATERIALS AND METHODS 

Strains, media and cell cultivation 

Saccharomyces cerevisiae strain INVScl (MATa his3Al 
leu2 trp 1-289 ura3-52 MATa his3Al leu2 trpl-289 



ura3-52) was purchased from Invitrogen (Life 
Technologies, Grand Island, NT, USA). S. cerevisiae 
L2612 (MATa leu2-3 leu2-112 ura3-52 trpl-298 canl 
cynl gai) was a gift from Prof. Yong-su Jin (20). Still 
Spirits (Classic) Turbo Distiller's Yeast was purchased 
from Homebrew Heaven (Everett, WA, USA). 
Escherichia coli DH5a (Cell Media Facility, University 
of Illinois at Urbana-Champaign, Urbana, IL, USA) 
was used for recombinant DNA manipulation. Yeast 
strains were cultivated in either synthetic dropout 
medium (0.17% Difco yeast nitrogen base without 
amino acids and ammonium sulfate, 0.5% ammonium 
sulfate and 0.083% amino acid drop out mix) or YPA 
medium (1% yeast extract, 2% peptone and 0.01% 
adenine hemisulfate) supplemented with sugar as a 
carbon source. E. coli strains were cultured in Luria 
broth (LB; Fisher Scientific, Pittsburgh, PA, USA). 
5". cerevisiae strains were cultured at 30°C and 250 rpm 
for aerobic growth, and 30°C and 100 rpm for oxygen 
limited conditions. E. coli strains were cultured at 37°C 
and 250 rpm unless specified otherwise. All restriction 
enzymes and plasmids, including pRS414, pRS415 and 
pRS416, were purchased from New England Biolabs 
(Ipswich, MA, USA). All chemicals were purchased 
from Sigma-Aldrich or Fisher Scientific. A detailed de- 
scription on fermentation conditions and HPLC analysis 
may be found in the Supplementary Method SI. 

Plasmid and strain construction 

Most of the cloning work was done using yeast homolo- 
gous recombination mediated by the DNA assembler 
method (19). DNA fragments flanked with regions 
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Figure 1. General scheme of the COMPACTER method for combinatorial pathway design. See text for details. 
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homologous to adjacent DNA fragments were generated 
with polymerase chain reaction (PCR) and all DNA frag- 
ments were purified and co-transformed into S. cerevisiae 
along with the backbone. To confirm the correct clones 
from transformants, yeast plasmids were isolated using a 
Zymoprep II yeast plasmid isolation kit (Zymo Research, 
Irvine, CA, USA) and transferred into E. coli. Plasmids 
from E. coli were then isolated and confirmed using diag- 
nostic PCR. A detailed description on creation of 
promoter mutants may be found in the Supplementary 
Method S2. 

Mutant pathway library creation 

To facilitate the construction of pathway libraries, helper 
plasmids were constructed to contain the DNA sequence 
homologous to the upstream of the pathway, a unique 
restriction site, the pathway gene, a terminator and 
DNA sequence homologous to the downstream of the 
pathway. The promoter mutant libraries were then 
cloned into corresponding helper plasmids using the 
DNA assembler method (19). The resulted constructs 
were confirmed using diagnostic PCR and the full gene 
expression cassette consisting of the upstream homolo- 
gous sequence, the promoter mutant, the gene, the termin- 
ator and the downstream homologous sequence were 
amplified using PCR. The full gene expression cassette 
were then mixed together and transferred with a linearized 
single copy shuttle vector to create the pathway library 
(Supplementary Figures S3 and 8). A detailed description 
on mutant pathway library creation may be found in the 
Supplementary Method S3. 

Library screening 

Libraries of xylose utilizing pathways and cellobiose 
utilizing pathways were screened using a multi-tiered 
screening strategy consisting of petri dish plate-based 
screening, culture tube-based screening and shake-flask 
based screening. A detailed description of the library 
screening protocol and subsequent characterization of 
the positive strains by fermentation and HPLC analysis 
may be found in the Supplementary Method S4. 

Quantitative PCR 

Fresh samples of cell cultures harvested at the exponential 
phase grown under the same condition as mentioned 
above for xylose or cellobiose fermentation were used to 
determine the relative expression ratio of metabolic genes 
via quantitative PCR (qPCR). The total RNA was 
isolated using the RNeasy mini kit (QIAGEN, Valencia, 
CA, USA) following the manufacturer's instructions. The 
RNA samples were then reversed transcribed into cDNA 
using the Transcriptor First Strand cDNA Synthesis kit 
using oligo-dT primer following the manufacturer's in- 
structions (Roche, Indianapolis, IN, USA). The qPCR ex- 
periments were carried out using the LightCycler® 480 
system (Roche, Indianapolis, IN, USA) using the 
SYBRGreen-based method following the manufacturer's 
instructions. 



RESULTS 

Creating a highly efficient xylose utilizing pathway in both 
laboratory and industrial Saccharomyces cerevisiae strains 

To demonstrate the utility of the COMPACTER method, 
we first sought to balance the metabolic flux in an engin- 
eered xylose utilizing pathway in Saccharomyces cerevisiae 
for lignocellulosic biofuel production. Xylose is the most 
abundant pentose sugar in hemicellulose hydrolysates, but 
cannot be directly utilized by the most commonly used 
ethanol producer, S. cerevisiae (21). Two types of 
pathways were used to enable xylose utilization in 
S. cerevisiae, the isomerase-based bacterial pathway and 
the reductase-dehydrogenase-based fungal pathway 
(22). The fungal xylose utilizing pathway consists of a 
xylose reductase, xylitol dehydrogenase and xylulokinase 
(Figure 2a). The fungal xylose utilizing pathway was used 
to demonstrate the utility of the COMPATER method, 
because the balancing of enzyme activities and cofactor 
usage is crucial for the performance of the pathway (22). 
Despite three decades of effort in optimizing the xylose 
utilizing pathway in S. cerevisiae for biofuels production 
(21-24), no existing recombinant xylose assimilating 
strains are efficient enough for commercial application. 
This may be due to the low expression level and activity 
of heterologous proteins, redox imbalance resulting from 
different cofactor preferences for oxidation and reduction 
reactions and/or suboptimal metabolic flux through the 
different catalytic steps (25). 

Based on the results from an enzyme screening program 
(Supplementary Figure SI), the xylose reductase from 
Candida shehatae (csXR), the xylitol dehydrogenase 
from Candida tropicalis (ctXDH) and the xylulokinase 
from Pichia pastoris (ppXKS) were selected to construct 
the xylose utilizing pathway in both laboratory and indus- 
trial yeast strains. In the enzyme screening program, 
enzyme activities of 20 xylose reductases, 22 xylitol de- 
hydrogenases and 19 xylulokinases were assayed using 
cell lysates. The csXR, ctXDH and ppXKS were chosen 
because they have the highest activity and closest matched 
cofactor specificity among all tested enzyme homologs. In 
the engineered xylose utilizing pathway, the expressions of 
csXR, ctXDH and ppXKS were driven by PDC1 
promoter mutants, TEF1 promoter mutants and EN02 
promoter mutants, respectively. A library of mutant 
xylose utilizing pathways was generated using pre-selected 
promoters of varying strengths. Briefly, each of the three 
constitutive yeast promoters — PDC1, EN02 and TEF1 — 
in the engineered xylose utilizing pathway was subjected to 
nucleotide analog mutagenesis (26) to create a pool of 
promoter mutants. Of note, nucleotide analog mutagenesis 
was used instead of conventional error-prone PCR 
because it can generate a very high mutation rate upon 
introduction of nucleotide analogs in the PCR reaction, 
which leads to rapid creation of promoter mutants with 
varying strengths. Approximately 10 promoter mutants of 
varying strengths were isolated for each promoter using 
green fluorescent protein (GFP) as a reporter 
(Supplementary Figure S2). The isolated promoter 
mutants were then used in the construction of the 
pathway library (Supplementary Figure S3). 
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Figure 2. Creation of a highly efficient xylose utilizing pathway in both laboratory and industrial 5. cerevisiae strains via COMPACTER. Red circle: 
xylose; blue triangle: ethanol. (a) Scheme of the engineered fungal xylose utilizing pathway, (b) Xylose fermentation of eight randomly picked clones 
from the COMPACTER library, (c) Comparison of xylose consumption and ethanol production in the industrial strain. Open symbol: the reference 
industrial strain CTY-XWT; solid symbol: the mutant industrial strain CTY-X7. (d) Comparison of xylose consumption and ethanol production in 
the laboratory strain. Open symbol: the reference laboratory strain INV-XWT; solid symbol: the mutant laboratory strain INV-X3. (e) Pathway 
switching in the industrial strain. Open symbol: the industrial strain harboring the pathway from the mutant laboratory strain CTY-X3; solid 
symbol: the mutant industrial strain CTY-X7. (f) Pathway switching in the laboratory strain. Open symbol: the laboratory strain harboring 
the pathway from the mutant industrial strain INV-X7; solid symbol: the mutant laboratory strain INV-X3. (g) Relative expression levels of 
XR, XDH and XKS in the mutant industrial strain CTY-X7 and the reference industrial strain CTY-XWT measured using qPCR. The expression 
levels were normalized by making the expression level of XR as 1. (h) Relative expression levels of XR, XDH and XKS in the mutant laboratory 
strain INV-X3 and the reference laboratory strain INV-XWT measured using qPCR. The expression levels were normalized by making the expres- 
sion level of XR as 1. 



To confirm the diversity of the library, eight colonies 
from the laboratory yeast strain INVScl were randomly 
picked for analysis. For these colonies, xylose consump- 
tion rate, ethanol productivity and ethanol yield were all 
different, indicating that the resultant pathway library 
exhibited excellent diversity (Figure 2b). The library was 
then screened using a colony size-based screening method 
(Supplementary Figures S4-7) and fast xylose utilizing 
mutant pathways were identified for both the laboratory 
and industrial strains (Table 1, Figures 2c and d). For 
the xylose utilizing pathway optimized in the laboratory 
strain INVScl, the optimized mutant laboratory strain 
INV-X3 improved the ethanol yield by more than 60% 
(0.25 g/g xylose for INV-X3 versus 0.16 g/g xylose for 
the reference laboratory strain INV-XWT that harbors 
the same xylose utilizing pathway under the control of 
wild-type promoters). Additionally, the INV-X3 labora- 
tory strain consumed xylose 70% faster and produced 
ethanol 1.5 times faster than the reference laboratory 
strain INV-XWT (0.4g/L/h xylose consumption rate and 
0.1 g/L/h ethanol production rate; Table 2). More impres- 
sively, after only one round of COMPACTER in the 
industrial strain 'Classic Turbo Yeast', the optimized 
mutant industrial strain CTY-X7 exhibited a xylose con- 
sumption rate of 0.92 g/L/h with an ethanol yield of 



0.26 g/g xylose, which is close to the fastest xylose utilizing 
strain reported in literature (27) (Table 2, Figures 2c). In 
contrast, the reference industrial strain harboring the same 
xylose utilizing pathway under wild-type promoters 
(CTY-XWT) consumed less than 9% of the total xylose 
and produced no ethanol within 88 h. 

The top 10 optimized xylose utilizing laboratory or in- 
dustrial strains were then isolated and the expression levels 
of XR, XDH and XKS were determined using quantita- 
tive PCR (qPCR). The mutant laboratory strain INV-X3 
exhibited a higher expression ratio of XDH/XR and XKS/ 
XR when compared to the reference laboratory strain 
INV-XWT (more than 1-fold higher XDH/XR ratio and 
almost 4-fold higher XKS/XR ratio; Figure 2h). 
Moreover, the top 10 mutant laboratory strains all ex- 
hibited higher relative expression ratio of XKS over XR 
(4.1 < XKS/XR < 19.3) when compared with the reference 
laboratory strain INV-XWT (XKS/XR- 2), while nine 
out of the top 10 mutant laboratory strains also exhibited 
a higher relative expression ratio of XR over XDH 
(Supplementary Figure SI 4a). Similarly, the top 10 
optimized industrial strains also exhibited very distinct 
expression levels of XR, XDH and XKS compared to 
the reference industrial strain CTY-XWT. The top 
10 mutants all exhibited a much lower relative expression 
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Table 1. List of strains and relevant xylose and cellobiose utilizing pathways used in this study 



Strain 



Host 



Pathway 



Note 



INV-XWT INVScl 



INV-X3 


INVScl 


INV-X7 


INVScl 


CTY-XWT 


Classic Turbo Yeast 


CTY-X3 


Classic Turbo Yeast 


CTY-X7 


Classic Turbo Yeast 


INV-CWT 


INVScl 


INV-C3 


INVScl 


INV-C59 


INVScl 


CTY-CWT 


Classic Turbo Yeast 


CTY-C3 


Classic Turbo Yeast 


CTY-C59 


Classic Turbo Yeast 



XWT: xylose pathway under wild type promoters 
(PDClp for csXR, TEFlp for ctXDH, EN02p for 
ppXKS) 

X3: Mutant xylose pathway optimized in the INVScl 
strain 

X3: Mutant xylose pathway optimized in the Classic 

Turbo Yeast strain 
XWT: xylose pathway under wild type promoters 
X3: Mutant xylose pathway optimized in the INVScl 

strain 

X3: Mutant xylose pathway optimized in the Classic 

Turbo Yeast strain 
XWT: cellobiose pathway under wild type promoters 

(PDClp for ghl-1, EN02p for cdt-1) 
X3: Mutant cellobiose pathway optimized in the 

INVScl strain 
X3: Mutant cellobiose pathway optimized in the 

Classic Turbo Yeast strain 
XWT: cellobiose pathway under wild type promoters 

(PDClp for ghl-1, EN02p for cdt-1) 
X3: Mutant cellobiose pathway optimized in the 

INVScl strain 
X3: Mutant cellobiose pathway optimized in the 

Classic Turbo Yeast strain 



Reference laboratory strain, xylose pathway 

Laboratory strain with an optimized xylose 
pathway 

Laboratory strain with a switched xylose pathway 

Reference industrial strain, xylose pathway 
Industrial strain with a switched xylose pathway 

Industrial strain with an optimized xylose pathway 

Reference laboratory strain, cellobiose pathway 

Laboratory strain with an optimized cellobiose 
pathway 

Laboratory strain with switched cellobiose 
pathway 

Reference industrial strain, cellobiose pathway 

Industrial strain with a switched cellobiose 
pathway 

Industrial strain with an optimized cellobiose 
pathway 



DNA sequences of promoter mutants in the optimized pathways are listed in Supplementary Information SI. 

Table 2. Xylose fermentation profiles of the optimized and reference strains compared with top xylose fermenting strains in literature 



Host strain 



Laboratory 
strain (INVScl) 



Industrial strain (Classic Turbo Yeast) 



D452-2 



IR-2 



IR-2 





INV-XWT 


INV-X3 


CTY-XWT 


CTY-XWT 


CTY-X7 


CTY-X7 


CTY-X7 


DA24-16 (27) 


MA-R4 (31) 


MA-R5 (31) 


Seed culture 


SCD U 


SCD a 


YPD b 


YPX C 


YPD b 


YPX C 


YPX C 


YPD b 


YPD b 


YPD b 


Initial OD 


1 


1 


10 


2 


10 


2 


10 


1 


2.8 g DCW/L 


2.8 g 






















DCW/L 


Xylose consumption rate 


0.24 


0.40 


0.06 


0.03 


0.74 


0.73 


0.92 


1.06 d 


1.07 


1.29 d 


(g xylose/L/h) 






















Ethanol productivity 


0.04 


0.10 


0 


0 


0.17 


0.17 


0.24 


0.35 d 


0.36 


0.50 d 


(g ethanol/L/h) 






















Yield (g ethanol/g xylose) 


0.15 


0.25 


0 


0 


0.24 


0.23 


0.26 


0.31 -0.33 d 


0.34 


0.37 d 



The reference strains are csXR, ctXDH and ppXKS driven by wild-type PDC1, TEF1 and EN02 promoters in corresponding strains. 
a SC-Ura supplemented with 2% glucose. 
b YP medium supplemented with 2% glucose. 
C YP medium supplemented with 2% xylose. 

d Calculated approximately according to figure shown in corresponding manuscript. 



ratio of XDH/XR and XKS/XR when compared to the 
reference industrial strain CTY-XWT (Supplementary 
Figure S14b). Specifically, the CTY-X7 industrial strain 
exhibited a relative expression ratio of XDH/XR almost 
two orders of magnitude lower when compared with the 
reference industrial strain CTY-XWT and an expression 
ratio of XKS/XR more than eight times lower (Figure 2g). 

Creating a highly efficient cellobiose utilizing 
pathway in both laboratory and industrial 

Saccharomyces cerevisiae strains 

To demonstrate the generality of the COMPACTER 
approach for pathway engineering, we sought to optimize 



the cellobiose utilizing pathway that we recently con- 
structed (28). Cellobiose, a disaccharide with two glucose 
molecules linked in a (3(1^4) bond, is an intermediate 
product from the enzymatic hydrolysis of cellulose (28). 
The cellobiose utilizing pathway consists of a cellobiose 
transporter and an intracellular P-glucosidase for the 
uptake and complete hydrolysis of cellobiose (Figure 3a). 
Fermentation of cellobiose rather than glucose represents 
a novel strategy for sugar utilization that has been shown 
to alleviate glucose repression and improve the co- 
fermentation of mixed sugars (27,28). 

To optimize the cellobiose utilizing pathway, the 
cellobiose transporter (CDT) gene (cdt-1) and the 
(3-glucosidase (BGL) gene from Neurospora crassa {ghl-1) 
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Figure 3. Creation of a highly efficient cellobiose utilizing pathway in both laboratory and industrial S. cerevisiae strains via COMPACTER. Red 
circle: cellobiose; black square: OD (A 60 o): blue triangle: ethanol. (a) Scheme of the engineered cellobiose utilizing pathway, (b) Library screening on 
an YPAC agar plate, (c) Comparison of cellobiose consumption and ethanol production in 250 ml flask fermentations in the industrial strain. Open 
symbol: the reference industrial strain CTY-CWT; solid symbol: the mutant industrial strain CTY-C59. (d) Comparison of cellobiose consumption 
and ethanol production in 250 ml flask fermentations in the laboratory strain. Open symbol: the reference laboratory strain INV-CWT; solid symbol: 
the mutant laboratory strain INV-C3. (e) Pathway switching in the industrial strain. Open symbol: the industrial strain harboring the pathway from 
the mutant laboratory strain CTY-C3; solid symbol: the mutant industrial strain CTY-C59. (f) Pathway switching in the laboratory strain. Open 
symbol: the laboratory strain harboring the pathway from the mutant industrial strain INV-C59; solid symbol: the mutant laboratory strain INV-C3. 
(g) Relative expression levels of CDT and BGL in the mutant industrial strain CTY-C59 and the reference industrial strain CTY-CWT measured 
using qPCR. The expression levels were normalized by making the expression level of CDT as 1 . (h) Relative expression levels of CDT and BGL in 
the mutant laboratory strain INV-C3 and the reference laboratory strain INV-CWT measured using qPCR. The expression levels were normalized by 
making the expression level of CDT as 1 . 



were assembled into a single copy expression vector under 
the control of the EN02 and PDC1 promoters, respect- 
ively. A library of cellobiose utilizing pathways derived 
from combinations of 10 EN02 promoter mutants and 
11 PDC1 promoter mutants were assembled in the indus- 
trial and laboratory S. cerevisiae strains. Similar to the 
above-mentioned xylose utilizing strains, the resultant 
mutant strains exhibited drastically different cellobiose 
fermentation abilities due to dissimilar expression levels 
of the sugar transporter and p-glucosidase (Supplementary 
Figure S9). The strains harboring the pathway library were 
screened using a colony size-based screening method 
(Supplementary Figures S10-13) and fast cellobiose 
utilizing mutant strains were identified for both laboratory 
and industrial strains (Table 1 and Figures 3c and d). For 
the 'Classic Turbo Yeast' industrial strain, the optimized 
mutant industrial strain CTY-C59 exhibited a 5.4-fold 
higher cellobiose consumption rate and a 5.3-fold higher 
ethanol productivity of 0.74g/L/h, when compared to 
the reference industrial strain CTY-CWT, which 
harbored the same cellobiose utilizing pathway under the 
control of the wild-type promoters (0.39g/L/h cellobiose 
consumption rate and 0.14g/L/h ethanol production rate). 
Similarly, for the INVScl laboratory strain, the optimized 
mutant laboratory strain INV-C3 exhibited a 2.1 -fold 



higher cellobiose consumption rate (1.50g/L/h) and a 
2.3-fold higher ethanol productivity (0.37g/L/h) when 
compared to the reference laboratory strain INV-CWT 
that harbored the same cellobiose utilizing pathway 
under the control of the wild-type promoters (Table 3). 

After analyzing the promoter mutants present in all 
optimized strains, it was observed that all of the five 
cellobiose utilizing mutant pathways in the INVScl 
strain are identical, consisting of the same EN02 and 
PDC1 promoter mutants. Eight of the 10 cellobiose 
utilizing mutant pathways in the 'Classic Turbo Yeast' 
strain contained the same ENO promoter mutant. The 
expression levels of cdt-1 and ghl-1 in the top 10 
optimized pathways of the industrial strain and the top 
five optimized pathways in the laboratory strain were also 
determined and compared to the reference pathways. The 
optimized mutant laboratory strain 1NV-C3 exhibited a 
relative expression ratio of BGL/CDT one order of mag- 
nitude lower than the reference laboratory strain (BGL/ 
CDT -0.6 for INV-C3 and BGL/CDT -6.6 for INV- 
CWT; Figure 3h). Similarly, the optimized mutant indus- 
trial strains all exhibited a relative BGL/CDT expression 
ratio much lower than the reference industrial strain 
CTY-CWT (optimized strains 0.5 < BGL/CDT < 1.8, ref- 
erence strain BGL/CDT ~ 31; Supplementary Figure SI 5). 
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Table 3. Summary of cellobiose fermentation profiles 



(Classic)-125 (Classic)-250 (INVScl)-125 (INVScl)-250 D452-2 



CTY-CWT CTY-C59 CTY-CWT CTY-C59 INV-CWT INV-C3 INV-CWT INV-C3 Ha el al. (27) 



Cellobiose consumption rate 0.36 1.60 0.39 2.18 0.60 1.54 0.7 1.5 1.67 

(g cellobiose/L/h) 

Ethanol productivity 0.14 0.65 0.14 0.74 0.16 0.51 0.16 0.37 0.7 

(g ethanol/L/h) 

Yield (g ethanol/g cellobiose) 0.42 0.44 0.37 0.39 0.32 0.37 0.23 0.27 0.42 



Two different shake-flasks, 125 and 250 ml, were used in fermentations. The fermentation behavior of the mutant strains constructed in this study 
was compared to the fastest cellobiose fermenting strain reported in the literature. 



Specifically, the CTY-C59 strain exhibited a relative ex- 
pression ratio of BGL/CDT more than 30 times lower 
when compared to the reference industrial strain 
CTY-CWT (Figure 3g). 

Engineered xylose or cellobiose utilizing pathways 
are host-specific 

During the sequence analysis of optimized xylose or 
cellobiose utilizing pathways, we noticed that the 
optimized expression patterns of pathways with identical 
metabolic genes in different strain backgrounds differed 
drastically. To investigate whether the engineered xylose 
or cellobiose utilizing pathways are dependent on the 
strain backgrounds, the most highly optimized mutant 
pathways found in the laboratory and industrial strains 
were exchanged. The resulting pathways showed different 
fermentation performances in a new strain background, 
indicating that the optimized pathways are strain-specific 
(Figures 2e and f and 3e and f). Furthermore, the relative 
expression levels of genes involved in the target pathways 
measured using qPCR also confirmed the optimal expres- 
sion profile of the same heterologous pathway was indeed 
different in distinct strain backgrounds (Figures 2g and h 
and 3g and h). 

As indicated by the pathway exchange experiment and 
the qPCR measurement, the pathways generated using 
COMPACTER were truly strain-specific. Surprisingly, the 
optimal expression pattern for a balanced metabolic flux 
through the xylose utilizing pathway differed drastically 
in laboratory and industrial strains. In the laboratory 
strain, a higher expression level of XDH than XR was 
required for efficient xylose assimilation. In the industrial 
strain, however, a higher expression level of XR than XDH 
was optimal. The distinct expression patterns of optimized 
pathways indicated the context-dependency of the heterol- 
ogous pathways in different metabolic backgrounds. 

DISCUSSION 

Engineering microorganisms to produce chemicals and 
fuels by introducing heterologous metabolic pathways is 
a conceptually simple task (1^4), yet the challenge of 
balancing metabolic flux through these pathways in 
order to maximize product titers and productivities 
remains overwhelming (6-11). Previously, two groups at- 
tempted to optimize a heterologous metabolic pathway 
using a combinatorial approach (29,30). However, both 



studies were limited to a very small library of metabolic 
pathways, one in a prokaryotic system (29) and the other 
in yeast using an inefficient in vitro cloning method (30). In 
contrast, the high efficiency of the COMPACTER method 
enables the creation of a large library of multi-step meta- 
bolic pathways through a single yeast transformation, 
which significantly increases the chance of obtaining an 
optimized pathway with balanced metabolic flux. In 
addition, unlike the previous study, in which a single 
promoter library was used to study the biological effect 
of varying expression levels of a target gene in a pathway 
(26), the COMPACTER method creates a library of meta- 
bolic pathways using a pool of promoters for each 
pathway gene so that the metabolic flux of the entire 
pathway can be rapidly optimized. 

Herein, we have demonstrated that the COMPACTER 
method is an efficient approach to tailor-make pathways 
for biofuels production from lignocellulosic biomass inde- 
pendent of knowledge concerning previous relevant meta- 
bolic engineering studies. In a single round, a recombinant 
xylose utilizing industrial strain was constructed that dis- 
played 69% of the xylose consumption rate of the fastest 
reported xylose utilizing strain (making it ranked 4th 
fastest overall; Table 2). Similarly, after a single round 
of COMPACTER, a recombinant cellobiose utilizing in- 
dustrial strain was constructed that exhibited the highest 
cellobiose consumption rate and ethanol productivity ever 
reported in literature (Table 3). 

The use of the newly developed DNA assembler method 
enabled the rapid creation of a mutant pathway library 
with varying gene expression levels through a single step 
of yeast transformation (19). The efficiency and diversity 
of the mutant pathway library achieved by DNA assem- 
bler was previously found to be excellent by DNA- 
sequencing analysis of 40 randomly selected individual 
constructs (32). In this study, the diversity of the 
pathway library was further investigated by checking the 
fermentation performance of randomly selected pathway 
mutants. The distinct fermentation behaviors of the 
mutant pathways from both the xylose and cellobiose 
utilizing pathway libraries indicated that the overall 
library diversity was also excellent (Figure 2b and 
Supplementary Figure S9). 

Both the xylose and cellobiose utilizing pathway 
libraries were screened using a colony size-based screening 
method to identify positive mutants with desired pheno- 
type. Since both of the target phenotypes can be linked 
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with cell viability on certain type of carbon sources, the 
colony size on an agar plate with a specific sugar as a sole 
carbon source gives a very good indication of the effi- 
ciency of the mutant pathway (Figure 3b). A set of large 
colonies were isolated and a culture tube-based screening 
was then performed to identify mutant pathways with 
improved fermentation ability. During the screening of 
50 large colonies isolated from the xylose utilizing 
mutant pathways in the 'Classic Turbo Yeast' strain, we 
confirmed that the specific growth rates of the strains 
correlated very well with the xylose consumption rates 
and ethanol production (Supplementary Figure S5). 

After isolation of the positive mutant pathways, the top 
mutant pathways for each target pathway were analyzed 
to determine the optimized expression pattern. For the 
xylose utilizing pathway, the fact that not all the 
pathway steps in our target pathways were equally sensi- 
tive to the change of promoters indicated that there were 
multiple points of optimality (Supplementary Figure S14). 
In addition, we noticed from our study that the relative 
expression levels (the ratio of expression levels) affected 
the performance of the pathway more than the absolute 
expression levels. To confirm that the isolated mutant 
pathway was truly optimal, the PDC1 promoter in the 
INV-X3 mutant was replaced by all other nine PDC1 
promoter mutants. As we expected, the nine altered con- 
structs all showed lower specific growth rates and xylose 
consumption rates compared to the INV-X3 strain, which 
indicated that the optimal mutant pathway identified was 
truly optimal (Supplementary Figure SI 9). For the 
cellobiose utilizing pathway, all the top five mutant 
pathways isolated from the INVScl strain contained the 
same EN02 and PDC1 promoter mutants. The fact that 
the highly diversified pathway library converged into a 
single solution after screening indicated that the selected 
pathway was highly optimized. For the 'Classic Turbo 
Yeast' strain, eight of the 10 final selected strains con- 
tained the same EN02 promoter mutant, whereas the 
top 10 strains performed similarly even though different 
PDC promoters were present in the mutant pathways. 
This also indicated there were multiple points of optimal- 
ity. However, when the top three mutant pathways from 
the initial screening were analyzed more carefully in shake 
flasks, the mutant pathway CTY-C59 outperformed the 
other two mutant pathways. This clearly showed that 
switching one promoter in the optimized pathway will 
yield a sub-optimal pathway, which supports the conclu- 
sion that the selected pathway was indeed optimized 
(Supplementary Figure Sll). 

The highly mobile pathway libraries constructed using 
the DNA assembler method enabled rapid pathway con- 
struction and optimization in different strain back- 
grounds. As a result, optimization from the same 
pathway libraries in the laboratory and industrial yeast 
strains yielded different optimized pathways for both the 
xylose and cellobiose utilizing pathways. The distinct 
optimized pathways for the laboratory and industrial 
strains were confirmed by exchanging the plasmids har- 
boring the optimal pathways and qPCR. As expected, the 
optimized strains all outperformed the switched strains, 
indicating that the optimized pathways were highly host 



specific. The effect of pathway switching was especially 
drastic for the xylose utilization pathway: the CTY-X3 
strain exhibited a very poor xylose fermentation 
ability compared to the CTY-X7 strain (Table 1 and 
Figure 2e). qPCR analysis of these strains indicated the 
optimal XR to XDH expression ratios were very different 
in the laboratory and industrial strains (Figures 2g and h). 
Although the context-dependent nature of heterologous 
pathways in different host strains has been reported (17), 
this is the first comprehensive and quantitative report of 
such context-dependency using two important heterol- 
ogous pathways for lignocellulosic biofuel production. 
The dependence of pathway optimization on a particular 
strain may result from differences in expression levels of 
endogenous genes involved in the pathway, different 
behavior of the same promoter under different strain 
backgrounds, the overall availability of cofactors or 
stress responses. Since the promoter mutants were 
characterized only in the laboratory strain, qPCR was 
performed to check the expression levels of same 
pathways in different strain backgrounds. The xylose 
utilizing mutant pathway optimized in the 'Classic 
Turbo Yeast' strain (X7) exhibited similar expression 
ratios in both the INVScl (INV-X7) and 'Classic Turbo 
Yeast' strains (CTY-X7). The expression ratios of the 
INV-X7 and CTY-X7 strains were completely different 
from the CTY-XWT and CTY-X3 strains. The fact that 
CTY-X7 and INV-X7 exhibited a similar expression ratio 
while CTY-WT, CTY-X3 and CTY-X7 exhibited very dif- 
ferent expression ratios indicated that the expression ratio 
difference was more affected by the promoter mutants 
in the pathway than the background of the host strain 
(Table 1 and Supplementary Figure SI 6). The enzyme 
activity levels in the different xylose utilizing strains were 
also analyzed to determine the origin of the host- 
specificity of the optimized pathways. The optimized 
pathways showed very different enzyme activities 
compared to the control pathways consisting of 
wild-type promoters. In Supplementary Figure S14, the 
optimized pathway in the INVScl strain (INV-X3) 
would require a higher expression of XDH while the 
optimized pathway in 'Classic Turbo Yeast' strain 
(CTY-X7) would require a higher expression of XR. The 
enzyme activity results were consistent with the expression 
level: the CTY-X7 strain did exhibit a higher XR activity 
and lower XDH activity compared to the INV-X3 strain, 
which would result in a higher XR to XDH ratio 
(Supplementary Figure SI 7). The enzyme activity results 
indicated that the optimization of the pathway not only 
happened at the expression level, but it also affected the 
enzyme activities in the resulted strains. In other words, 
the optimized pathway that performed better in a specific 
strain background was due to the interaction of optimized 
promoters and the metabolic background of the host 
strain. The introduction of promoter mutants simultan- 
eously at multiple steps in the pathway altered the 
overall expression levels, which consequently changed 
the translation level and protein folding of the heterol- 
ogous enzymes. As a result, the simultaneous optimization 
made the heterologous pathway fit better in the specific 
strain background and generated recombinant strains with 
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better performance. The fact that the optimized strains 
differed in so many different aspects further demonstrated 
the power of the strategy described in this study — the 
power of rapid generation and examination of a 
large number of mutant pathways through a single 
step of in vivo pathway optimization (Supplementary 
Figures S17 and 18). Within this context, it may be very 
difficult to reverse engineer an industrial S. cerevisiae pro- 
duction strain using knowledge gained from the engineer- 
ing of the simplified model strains used in laboratories. As 
a corollary, the ability of COMPACTER to tailor-make 
metabolic pathways rapidly in different strain back- 
grounds is truly advantageous in pathway engineering. 

Finally, due to its simplicity and versatility, the 
COMPACTER method can be used for simultaneous op- 
timization of gene expression levels in any metabolic 
pathway. It can rapidly optimize microbial cell factories 
for the production of many useful compounds such as 
fuels, platform chemicals and pharmaceuticals. In 
addition, it can be extremely effective at generating meta- 
bolic pathway libraries, gene circuits, gene switches, syn- 
thetic regulatory networks and even artificial 
chromosomes with varying expression patterns for appli- 
cations in metabolic engineering and synthetic biology. 
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